Quikr: a method for rapid reconstruction of bacterial communities via compressive sensing

نویسندگان

  • David Koslicki
  • Simon Foucart
  • Gail L. Rosen
چکیده

MOTIVATION Many metagenomic studies compare hundreds to thousands of environmental and health-related samples by extracting and sequencing their 16S rRNA amplicons and measuring their similarity using beta-diversity metrics. However, one of the first steps--to classify the operational taxonomic units within the sample--can be a computationally time-consuming task because most methods rely on computing the taxonomic assignment of each individual read out of tens to hundreds of thousands of reads. RESULTS We introduce Quikr: a QUadratic, K-mer-based, Iterative, Reconstruction method, which computes a vector of taxonomic assignments and their proportions in the sample using an optimization technique motivated from the mathematical theory of compressive sensing. On both simulated and actual biological data, we demonstrate that Quikr typically has less error and is typically orders of magnitude faster than the most commonly used taxonomic assignment technique (the Ribosomal Database Project's Naïve Bayesian Classifier). Furthermore, the technique is shown to be unaffected by the presence of chimeras, thereby allowing for the circumvention of the time-intensive step of chimera filtering. AVAILABILITY The Quikr computational package (in MATLAB, Octave, Python and C) for the Linux and Mac platforms is available at http://sourceforge.net/projects/quikr/.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Block-Based Compressive Sensing Using Soft Thresholding of Adaptive Transform Coefficients

Compressive sampling (CS) is a new technique for simultaneous sampling and compression of signals in which the sampling rate can be very small under certain conditions. Due to the limited number of samples, image reconstruction based on CS samples is a challenging task. Most of the existing CS image reconstruction methods have a high computational complexity as they are applied on the entire im...

متن کامل

Image Compressive Sensing Recovery Using Group Sparse Coding via Non-convex Weighted Lp Minimization

Compressive sensing (CS) has attracted considerable research from signal/image processing communities. Recent studies further show that structured or group sparsity often leads to more powerful signal reconstruction techniques in various CS taskes. Unlike the conventional sparsity-promoting convex regularization methods, this paper proposes a new approach for image compressive sensing recovery ...

متن کامل

Bacterial Community Reconstruction Using A Single Sequencing Reaction

Bacteria are the unseen majority on our planet, with millions of species and comprising most of the living protoplasm. While current methods enable in-depth study of a small number of communities, a simple tool for breadth studies of bacterial population composition in a large number of samples is lacking. We propose a novel approach for reconstruction of the composition of an unknown mixture o...

متن کامل

Sensing Matrix Design via Capacity Maximization for Block Compressive Sensing Applications

It is well established in the compressive sensing (CS) literature that sensing matrices whose elements are drawn from independent random distributions exhibit enhanced reconstruction capabilities. In many CS applications, such as electromagnetic imaging, practical limitations on the measurement system prevent one from generating sensing matrices in this fashion. Although one can usually randomi...

متن کامل

Image Reconstruction based on Block-based Compressive Sensing

The data of interest are assumed to be represented as Ndimensional real vectors, and these vectors are compressible in some linear basis B, implying that the signals can be reconstructed accurately using only a small number of basis function coefficients associated with B. A new approach based on Compressive Sensing (CS) framework which is a theory that one may achieve an exact signal reconstru...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Bioinformatics

دوره 29 17  شماره 

صفحات  -

تاریخ انتشار 2013